Learning to take risks

نویسندگان

  • Sandip Sen
  • Neeraj Arora
چکیده

Agents that learn about other agents and can exploit this information possess a distinct advantage in competitive situations. Games provide stylized adversarial environments to study agent learning strategies. Researchers have developed game playing programs that learn to play better from experience. We have developed a learning program that does not learn to play better, but learns to identify and exploit the weaknesses of a particular opponent by repeatedly playing it over several games. We propose a scheme for learning opponent action probabilities and a utility maximization framework that exploits this learned opponent model. We show that the proposed expected utility maximization strategy generalizes the traditional maximin strategy, and allows players to benefit by taking calculated risks that are avoided by the maximin strategy. Experiments in the popular board game of Connect-4 show that a learning player consistently outperforms a non-learning player when pitted against another automated player using a weaker heuristic. Though our proposed mechanism does not improve the skill level of a computer player, it does improve its ability to play more effectively against a weaker opponent.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CHILDREN\'S RISKY ACTIVITIES AND PARENTS\' IDEAS ON CHILDREN\'S RISK-TAKING BEHAVIOUR

A cross-sectional study with children's and parents' self-completed questionnaires was carried out to evaluate parents' ideas on children's risk-takjng behaviours and children's risky activities after school hours by age (7 and 9 years) and sex. Nine elementary schools were randomly selected and 476 pupHs aged seven and nine years and 471 parents were studied. Most parents (90.1 %) believed tha...

متن کامل

Machine Learning and Citizen Science: Opportunities and Challenges of Human-Computer Interaction

Background and Aim: In processing large data, scientists have to perform the tedious task of analyzing hefty bulk of data. Machine learning techniques are a potential solution to this problem. In citizen science, human and artificial intelligence may be unified to facilitate this effort. Considering the ambiguities in machine performance and management of user-generated data, this paper aims to...

متن کامل

Let’s Take it to the Clouds: The Potential of Educational Innovations, Including Blended Learning, for Capacity Building in Developing Countries

In modern decentralised health systems, district and local managers are increasingly responsible for financing, managing, and delivering healthcare. However, their lack of adequate skills and competencies are a critical barrier to improved performance of health systems. Given the financial and human resource, constraints of relying on traditional face-to-face training to upskill a large and dis...

متن کامل

P170: Predictors of Test Anxiety: Perfectionism and Goal Orientation

Test anxiety as a common disorder is a physiological condition in which students experience extreme stress, anxiety and discomfort during and/or before taking a test. This kind of anxiety results from the sense of threats like fear of failure, lack of confidence and setting unattainable goals in learning and wishing to be perfect in academic situation. Resulting anxiety disrupts attention, memo...

متن کامل

Learning Styles and the Writing Process in a Digitally Blended Environment: Revising, Switching, and Pausing Behaviors in Focus

The present investigation sought to explore the relationship between learning styles and writing behaviors of EFL learners in a blended environment. It also aimed to identify the learning style types best predicting writing behaviors. Initially, the participants' preferred learning styles were identified through the Kolb’s learning style inventory (Kolb, 1984). Secondly, data were obtained thro...

متن کامل

Identifying natural gas loss risks and ranking of corrective actions

The aim of this study was to provide a new model for identifying the sources and sources of waste gas in Mahdishahr city gas department and to define corrective measures and prioritize measures to help managers to make appropriate decisions to reduce waste gas. The research method is descriptive-analytical in terms of nature and is applied in terms of purpose. The statistical sample of the rese...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002